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of  Bayesian  methods  and  time  series  techniques  under  the  sponsorship  of 
this  project.  In  this  final  report,  we  shall  provide  a  brief 
summary  of  the  main  results  of  the  following  principal  areas  of  research: 
(1)  analysis  and  decomposition  of  seasonal  time  series,  (11)  effect 
of  temporal  and  contemporal  aggregation  of  time  series,  (11  i)  Intervention 
techniques  in  time  series,  (iv)  modelling  and  analysis  of  multiple  time 
series,  (v)  asymptotic  properties  of  least  squares  estimates  In  time 
series  and  extended  autocorrelation  function,  (vi)  pooling  of  time 
series,  (vii)  effect  of  outlier  in  time  series,  ( vi i 1 )  science  and 
statistics,  (ix)  robustness  in  statistical  analysis,  (x)  unified  theory 
of  statistical  criticism  and  inference. 

Throughout  this  report,  we  shall  frequently  refer  to  the  following 
univariate  and  multiple  time  series  models.  Let  {Z^  be  a  series  of 
observations  taken  at  equal  spaced  time  intervals  and  following  the 
mixed  autoregressive  moving  average  model 


<j>(B)zt  ■  8(B)at  (1) 

where  z^.  =  Z^-n,  n  is  a  location  parameter,  4>(B)  *  1  B- _ 

and  0(B)  =  1-9^B-. . ,-e  B^  are  polynomials  in  B  have  zeros  lying  on  or 
outside  the  unit  circle,  B  is  the  backshift  operator  such  that 
Bzt  =  and  {a^}  are  a  series  of  white  noise,  identically  and  independently 

distributed  as  normal  with  zero  means  and  common  variance  at  .  When  K  series 

a 


(7jJ  ,  where  . . .  ,Zkt) ,  are  simultaneously  considered,  a 

vector  analog  of  (1)  Is  the  model 

$(B)zt  *  !(B}!t  (2) 

where  z t  *  Zt-n,  n  Is  a  vector  of  locations,  tf>(B)  =  . --4>pBp  and 

0(B)  3  1-0^8-. . .-9^  are  matrix  polynomials  such  that  the  determlnantal 
polynomials  } d>(B ) |  and  } 9(B) j  have  all  zeros  lying  on  or  outside  the  unit 
circle  and  {at)  are  Identically,  independently  and  normally  distributed  as 
normal  with  zero  mean  vector  and  covariance  matrix 

Summary  of  Main  Results 

(1 )  Analysis  and  decomposition  of  seasonal  time  series; 

Economic  and  environmental  time  series  often  exhibit  a  strong  cyclical 

or  seasonal  behavior.  It  has  been  argued  that  seasonality  should  be 

removed  from  the  series  so  that  the  underlying  trend  becomes  more  clearly 

discern able.  Suppose  that  an  observable  time  series  Z  can  be  written 

¥ 

additlvely  as 

Zt  =  St  +  Tt  +  et 

where  St  Is  the  seasonal  component,  Tf  Is  the  trend  component  and  et  Is 
the  Irregular  or  noise  computer.  A  widely  used  method  Is  the  X-ll 
seasonal  adjustment  procedure  empirically  developed  by  the  U.S.  Bureau 
of  the  Census  over  a  number  of  years.  It  employs  specific  moving  average 
filters  to  estimate  St  and  Tt  from  the  data  Z The  main  weakness  of  this 
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procedure  Is  that  the  underlying  mechanism  of  St  and  Is  not  clearly 
specified  which  makes  It  difficult  to  check  the  appropriateness  of  the  method 
In  any  given  application.  In  [1],  we  have  found  specific  ARMA  models 
of  the  form  In  (1)  for  S^,  Tt  and  e^  for  which  the  Census  filters  are 
approximately  optimal.  The  models  for  the  components  then  Imply  an  overall 
model  for  which  can  be  verified  form  the  data. 

The  work  in  [1]  provides  a  link  between  an  empirically  based  procedure 
such  as  X-ll  and  procedures  which  are  based  on  models  of  1^.  Model  based 
decomposition  and  seasonal  adjustment  procedures  are  developed  In  [2], 

[3],  [4]  and  [5].  In  this  approach,  an  overall  model  for  Zt  of  the  form  (1) 
Is  first  built  from  the  data.  Guided  by  the  nature  of  the  model,  specific 
assumptions  are  then  made  to  decompose  Zt  into  St,  Tt  and  e^.  A  canonical 
decomposition  is  proposed  which  minimizes  the  innovation  variances  of 
St  and  Tt.  This  decomposition  Is  shown  to  be  unique  when  It  exists  and 
has  a  number  of  desirable  properties.  In  [5]  this  procedure  is  extended  to 
take  Into  account  other  relevant  issues  such  as  trading  day  and  holiday 
effects  as  well  as  removal  of  spurious  observations.  Our  work  has  progressed 
to  the  point  that  a  large  pilot  experiment  is  planned  at  the  Census 
Bureau  to  study  the  performance  of  the  model  based  procedure  with  that  of 
X-ll,  for  possible  modifications  or  replacement  of  the  latter  In  the 
future. 

In  modeling  seasonal  time  series,  a  model  of  the  form  (1)  has  been 
frequently  found  useful.  It  should,  however,  be  noted  that  models  of 
this  kind  assume  that  the  time  dependent  structure  among  the  observations  Is 
Invariant  to  shift  In  origin.  This  may  not  be  valid  in  some  situations. 

For  example, variability  of  weather  may  be  different  from  one  season  to 
the  next,  or  the  correlation  of  the  demand  for  a  certain  Item  between  two 
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adjacent  months  may  be  higher  In  the  summer  than  In  the  spring.  In 
[6],  properties  of  a  class  of  periodic  models  for  characterizing  such 
series  are  explored,  and  the  effects  of  misspecifylng  a  homogeneous  ARMA 
model  (1)  for  series  having  a  periodic  structure  are  studied. 

(11)  Effect  of  temporal  and  contemporal  aggregation  of  time  series 

The  problem  of  temporal  and  contemporal  aggregation  of  time  series 
data  has  attracted  the  Interest  of  many  econmetri clans.  In  [7],  we  have 
studied  the  effect  on  forecasting  of  temporal  aggregation  of  a  univariate 
series  Zt  following  the  model  (1).  It  Is  shown  that  when  the  length  of 
aggregation  is  increased,  the  model  for  the  aggregate  approaches  a  limiting 
form  which  depends  on  the  multiplicity  of  the  unit  roots  of  the  autoregressive 
polynomial  <p(B) .  This  result  Is  then  exploited  to  compare  the  variances 
of  forecasting  aggregates  using  (1)  model  for  the  aggregate  and  (11)  model 
for  the  basic  series  Zt,  and  the  main  conclusion  Is  that  there  Is  a 
substantial  gain  in  using  the  basic  series  when  It  is  nonstationary.  The 
implication  Is  that  It  Is  worthwhile  to  use  disaggregated  data,  If  available, 
in  forecasting  aggregate  when  the  series  Is  nonstationary. 

This  work  Is  extended  in  [8]  to  assess  the  effect  of  temporal 
aggregation  on  dynamic  relationship  between  two  time  series  variables. 

We  have  found  that  temporal  aggregation  can  turn  a  unidirectional  transfer 
function  relationship  Into  a  feedback  form.  For  example,  suppose  that  at 
the  monthly  level,  consumption  linearly  depends  on  income  of  the  last  month, 
and  income  follows  a  model  of  the  form  (1)  Independent  of  past  consumption. 
Thus  a  unidirectional  relationship  exists  between  them.  Nevertheless  it  may 
be  shown  that  for  aggregated  quarterly  income  and  consumption  data,  the 
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relatlonshlp  will  no  longer  be  unidirectional.  Recently,  a  number  of 
econometricians  have  attempted  to  Identify  causality  with  unidirectional 
transfer  function  relationship  found  in  the  data.  Our  results  suggest 
that  extreme  care  must  be  exercised  In  any  such  attempted  Interpretation 
and  misleading  conclusions  can  be  drawn  If  the  wrong  sampling  Interval  Is 
used. 

In  the  problem  of  contemporal  aggregation,  we  are  interested  In 
studying  the  stochastic  structure  of  a  linear  aggregate  of  several  series 
at  the  same  time  point,  and  In  comparing  forecasts  made  from  past  data  on 
the  aggregate  with  those  from  the  component  series.  Our  work  In  [9] 
shows  the  relationship  of  the  stochastic  models  for  the  aggregate  and  the 
components,  and  obtains  sufficient  and  necessary  conditions  for  which  there 
will  be  no  gain  In  accuracy  in  using  the  component  series  to  forecast  the 
aggregate. 

( 1 1 i )  Intervention  techniques  in  time  series 

In  analyzing  time  series  data,  exogenous  interventions  often  occur, 
the  effect  of  which  Is  not  necessarily  Immediate  on  the  level  of  the 
series.  The  impact  of  advertising  on  the  sales  of  certain  product  or 
that  of  the  implementation  of  automobile  exhaust  control  measures  on  air 
quality  are  examples  of  such  Interventions.  While  much  Is  known  about 
the  analysis  of  data  of  this  kind  when  the  errors  are  assumed  Independent 
and  the  effects  of  intervention  are  Immediate,  methods  developed  under 
such  assumptions  are  not  valid  for  time  series  data.  In  [10],  it  Is  shown 
that  dynamic  effects  of  intervention  may  be  well  represented  by  difference 
equation  models,  and  serial  dependence  of  errors  may  be  characterized  by 
(1)  (also  a  difference  equation  model).  The  theory  and  methods  are 
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11  lustrated  by  two  actual  examples,  and  some  further  examples  are  given 
In  [11]. 

An  alternative  analysis  of  the  effects  of  intervention  Is  given  in 
[12],  In  which  forecasts  made  prior  to  the  onset  of  an  intervention  are 
compared  with  actual  observations  since  the  Intervention.  This  provides 
a  simple  and  natural  way  to  formulate  models  for  characterizing  the  dynamics 
of  the  Intervention.  Finally,  In  [13],  a  detailed  study  Is  made  on 
properties  of  parameter  estimates  of  a  first  order  dynamic  model  In 
intervention  analysis,  showing  that  suitable  reparameterlzation  can 
appreciably  reduce  the  correlation  between  parameter  estimates  In  circum¬ 
stances  frequently  met  in  practice, 

(iv)  Analysis  and  modeling  of  multiple  time  series 

Economic,  engineering  and  environmental  time  series  data  are  often 
available  on  several  related  variables  of  Interest.  One  may  wish  to  model 
and  analyze  these  series  jointly  In  order  to  (1)  understand  the  dynamic 
relationships  among  these  series  and  (11)  Improve  accuracy  of  forecasts. 

In  the  last  few  years,  we  have  devoted  a  considerable  effort  to  develop  an 
Iterative  approach  to  modeling  multiple  time  series.  An  extensive  class 
of  models  for  such  series  Is  the  vector  ARMA  models  in  (2)  which  allow 
for  the  possibilities  of  unidirectional  as  well  as  feedback  relationships. 

In  [14],  we  have  extended  to  the  vector  case  the  univariate  time  series 
modeling  approach  proposed  by  Box  and  Jenkins,  In  Time  Series  Analysis 
Forecasting  and  Control,  Holden-Day,  (1970,  1976).  A  three  stage  model 
building  procedure  consisting  of  (a)  tentative  Identification,  (b)  estimation 
and  (c)  diagnostic  checking  has  been  developed  and  tested  on  a  number  of 
real  examples.  In  particular,  methods  for  tentative  Identification  employ 


cross  correlations  and  partial  autoregressions  with  simplifying  Indicator 
symbols  to  aid  In  the  users'  comprehension  of  the  vast  amount  of  Information 
contained  In  these  measures.  Parameter  estimates  are  obtained  by  maximizing 
the  exact  likelihood  function  the  properties  of  which  are  discussed  In 
[15]. 

To  aid  In  the  understanding  of  the  stochastic  structure  of  multiple 
time  series,  various  principal  component  and  canonical  analyses  have  been 
developed.  In  [16],  we  have  considered  linear  transformations  of  the 
vector  Zt  In  (2)  to  characterize  the  degree  of  predictability  of  the  data. 
Transformed  series  with  high  degree  of  predictability  represent  the  growth 
characteristics  In  the  data,  while  those  with  little  predictability 
represent  stable  relationships.  Such  analysis  has  been  found  useful  In 
uncovering  Interesting  and  meaningful  substantive  structures  in  the  series. 

A  computer  package  Incorporating  the  Iterative  modeling  approach 
as  well  as  the  various  principal  component  and  canonical  analyses  has  been 
completed  [17].  Over  the  last  2  years  It  has  been  distributed  to  nearly 
200  Industrial,  academic,  and  governmental  institutions  throughout  the  world. 

(v)  Asymptotic  properties  of  least  squares  estimates  In  time  series  and 

extended  autocorrelation  function 

For  stationary  autoregressive  model,  l.e.  In  (1)  q  *  0  and  all  zeros 
of  <j>(B)  are  lying  outside  the  unit  circle.  It  is  well  known  that  the 
ordinary  least  squares  (OLS)  estimates  of  the  autoregressive  parameters 
are  consistent  ana  asymptotically  normally  distributed.  However,  except 
for  certain  special  cases, asymptotic  properties  of  the  OLS  estimates  have 
not  been  established  In  the  literature  for  q  t  0  and/or  when  the  series  Is 
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not  stationary  and, largely  because  of  this,  no  simple  method  has  been 
available  for  tentative  Identification  of  mixed  ARMA  models. 

In  [18],  are  able  to  show  the  following  results.  Let  41(B)  ■  u(B)4>j(B) 
where  U(B)  ■  1-U^B-. . .-UdBd  has  all  Its  zeros  on  the  unit  circle  and 
4»j(B)  »  1  . . . -4>rBr  has  all  Its  zeros  outside  the  unit  circle.  Then 

(a)  the  OLS  estimates  of  an  AR(d)  fitting  are  consistent  for  the  parameters 
and  (b)  the  OLS  estimates  of  an  AR(d+r)  fitting  are  consistent 
for  4>.j . 4>p  In  (1)  only  when  q  ■  0. 

In  [19],  an  Iterated  regression  method  Is  proposed  which  yields 
consistent  estimates  of  ♦•j».*.,<frp  for  known  q.  Since  the  residuals  from 
the  Iterated  regression  approximately  follow  a  moving  average  model  of 
order  q,  this  has  made  It  possible  to  define  a  class  of  extended  auto¬ 
correlations  as  a  function  of  p  and  q,  for  tentative  specification  of  the 
order  of  a  mixed  ARMA  model.  Extension  to  the  vector  ARMA  model  (2) 

Is  currently  being  Investigated. 

(vi )  Pooling  of  time  series 

In  recent  years,  there  has  been  a  great  deal  of  Interest  In  adopting 
Bayesian  methods  for  the  estimation  of  means  and  variance  components  in 
random  effect  models.  In  [20],  such  a  Bayesian  random  effect  model 
approach  has  been  applied  to  the  analysis  of  panel  data  In  a  time  series 
context.  A  detailed  analysis  Is  given  of  a  panel  of  first  order  auto¬ 
regressive  models  where  the  autoregressive  coefficients  across  the  panel  are 
regarded  as  a  random  sample  from  a  beta  distribution.  Efficiency  of  pooling 
the  estimates  Is  discussed  and  Illustrated  by  a  real  example. 
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(vll)  Effect  of  outliers  In  time  series 

In  practice  data  often  contain  one  or  more  spurious  observations  but 
these  may  not  be  easy  to  recognize.  It  Is  Important  that  statistical 
methods  are  devised  which  guard  against  adverse  effect  of  spurious 
observations.  Methods  proposed  In  the  literature  for  doing  this  have 
frequently  made  the  assumption  that  the  errors  are  Independent  which  Is 
unrealistic  for  time  series  data.  In  [21],  rules  have  been  proposed  to 
deal  with  outliers  for  a  first  order  autoregressive  process.  The  efficacy 
of  the  rules  Is  studied  In  relation  to  estimating  the  mean  and  the  auto¬ 
regressive  coefficient. 

(vtll)  Science  and  statistics 

An  Important  question  facing  scientists  and  statisticians  is  the 
precise  role  that  statistics  plays  In  scientific  Investigation.  History 
of  science  shows  that  scientific  progress  Is  achieved  largely  through  a 
continued  iteration  between  theory  and  practice.  In  [22],  a  Fisher 
Memorial  Lecture  to  the  American  Statistical  Association,  the  work  of 
R.A.  Fisher  Is  used  to  Illustrate  the  symbiotic  relation  of  practice  and 
theory  and  to  show  how  rapidly  progress  can  occur  when  the  Interactive 
process  Is  allowed  to  operate.  It  is  argued  that  In  order  to  help  solve 
real  problems  statistical  methods  should  be  developed  In  this  mode. 

(lx)  Robustness  in  statistical  analysis 

An  approach  to  robustlfi cation  by  appropriately  modifying  the 
model  and  using  standard  Bayes  eatlmation  Is  further  studied  In  [25], 
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In  [23]  the  down  weighting  patterns  applied  to'butllers"  resulting  from 
such  a  treatment  are  compared  with  those  resulting  from  the  Huber  Tukey 
Andrews  approach  In  which  the  method  of  estimation  rather  than  the 
model  Is  modified. 

In  much  of  the  literature  In  robust  estimation  It  Is  tacitly  or  Implicitly 
Implied  that  errors  behave  as  If  randomly  drawn  from  some  heavy-tailed 
distribution.  Examination,  In  [24],  of  several  sets  of  classical 
scientific  data  supposed  to  support  this  contention  suggest  a  different 
situation.  It  seems  that  the  errors  are  represented  by  a  roughly  normal 
distribution  with  a  given  mean  and  variance  for  one  period  of  time  and 
then  at  a  later  period  of  time  the  variance  and/or  the  mean  change  to  some 
other  values.  This  will  occur  for  example  when  pilot  experimentation 
during  which  the  apparatus  Is  being  "broken  In"  Is  followed  by  a  series 
of  reasonably  homogeneous  runs  during  which  the  apparatus  Is  operating  well. 

(x)  A  unified  theory  of  statistical  criticism  and  Inference 

In  [26]  It  Is  argued  that  two  different  kinds  of  Inference  are 
needed  in  scientific  Iteration.  Criticism  In  which  the  compatibility  of 
model  and  data  Is  questioned  and  estimation  In  which  the  Implications  of 
model  and  data  are  combined  assuming  the  compatibility.  The  view  Is 
advanced  that  ultimately  criticism  must  rest  on  a  sampling  theory  argument 
using  the  predic;:ve  distribution  as  references  whereas  estimation  should  be 
done  using  the  Bayes  posterior  distribution. 

The  Implications  for  diagnostic  checks,  tests  of  fit  and  robust  and 
shrinkage  estimators  is  considered. 
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